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Goal  and  Objectives 

The  goal  of  this  project  was  to  understand  how  to  create  intelligent  agents  that  can  learn  by  reading,  in 
order  to  perform  robust  commonsense  reasoning  and  interact  effectively  with  human  partners.  This 
requires  deep  understanding  of  the  conceptual  knowledge  conveyed  by  texts  and  constructing 
knowledge  that  can  be  used  in  subsequent  reasoning,  by  contrast  with  shallow  information-retrieval 
approaches.  It  also  requires  the  ability  to  incrementally  adapt  to  new  language  usage  by  their  partners, 
and  doing  so  rapidly,  with  only  a  small  number  of  examples  (unlike,  say,  "deep  learning"  systems  that 
require  orders  of  magnitude  more  examples  than  analogical  learning  systems).  Our  overall  hypotheses 
are  that  analogical  processing  plays  multiple  roles  in  learning  by  reading,  and  that  qualitative 
representations  provide  a  foundation  for  common  sense  knowledge  that  can  be  used  for  reasoning  and 
as  part  of  the  semantics  for  natural  language  (Forbus  et  al.  2015).  Our  accomplishments  on  this  project 
provides  strong  evidence  for  these  hypotheses. 


Summary  of  Accomplishments 

Narrative  Function  &  Qualitative  Representations  in  Semantics 

The  idea  of  narrative  function  is  to  drive  understanding  of  a  text  via  context,  i.e.  what  should  a  reader  be 
taking  away  from  reading  this  text?  Narrative  function  can  be  seen  as  a  kind  of  communication  act,  but 
the  idea  goes  a  bit  beyond  that.  Communication  acts  are  often  coarse-grained,  e.g.  INFORM,  QUERY. 
Narrative  functions  help  bridge  between  the  representations  naturally  produced  via  lower  levels  of 
language  understanding  and  the  rich  conceptual  representations  that  need  to  be  inferred  from  them. 
Narrative  functions  provide  queries  that  can  be  asked  of  texts.  For  example,  in  learning  qualitative 
models  from  texts,  our  system  begins  by  looking  for  occurrences  of  quantities,  then  relationships 
involving  them  (e.g.  changes,  ordinal  relations,  causal  influences)  and  finally  processes  that  connect 
them.  These  queries  operate  abductively,  in  that  they  are  capable  of  making  assumptions  based  on  the 
alternatives  that  are  provided  by  the  initial,  underspecified  semantic  representations  produced  by  our 
parser.  The  use  of  abduction  for  this  kind  of  semantic  interpretation  is  standard,  but  our  contextualized, 
knowledge-level  guidance  of  the  process  is  novel. 

We  have  shown  that  our  account  of  narrative  function  can  extract  instance-level  qualitative 
representations  from  texts.  In  addition  to  handling  the  eight  examples  from  our  prior  work,  the  system 
was  tested  on  nine  simplified  English  paragraphs  from  a  science  book.  On  this  small  corpus,  it  achieved 
a  precision  of  0.74  and  recall  of  0.45  (McFate  et  al.  2014).  Furthermore,  we  expanded  our  account  of 
narrative  functions  to  combine  type-level  as  well  as  instance-level  qualitative  representations  (McFate 
et  al.  2013,  2014).  Instance-level  qualitative  representations  concern  specific  situations,  e.g.  "Niagara 
Falls  produces  5.4  gigawatts  of  electricity".  Type-level  qualitative  representations  concern  general 
information,  e.g.  "Solar  panels  produce  electricity."  Flexible  understanding  of  texts  requires  handling 
both  kinds  of  knowledge,  e.g.  instructional  texts  invariably  combine  both,  to  provide  both  general 
principles  and  to  tie  them  into  specific  examples  to  help  ground  them  for  learners.  For  our  systems, 
such  examples  provide  base  domains  for  analogies  to  new  situations,  as  well  as  inputs  to  analogical 
generalization. 
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We  have  developed  a  systematic  mapping  between  constructs  of  QP  theory  and  FrameNet  (McFate  & 
Forbus,  2015),  which  is  interesting  for  two  reasons.  First,  the  ability  to  create  such  a  mapping  provides 
further  evidence  that  QP  theory  can  play  an  important  inferential  role  in  natural  language  semantics. 
Second,  since  FrameNet  provides  a  very  broad  coverage  of  English,  tying  syntactic  constraints  on 
FrameNet  frames  (called  valence  patterns)  to  particular  QP  theory  representations  sets  the  stage  for 
radically  expanding  our  coverage  of  natural  language  semantics  for  learning  by  reading.  This  analysis 
also  provides,  for  the  first,  time,  methods  of  identifying  limit  points  in  natural  language  texts.  (In  QP 
theory,  a  limit  point  is  a  threshold  where  activity  changes,  e.g.  boiling  point  or  pressure  equalization.) 

To  test  this  analysis,  we  analyzed  a  new  corpus  of  science  articles  drawn  from  the  Simple  English 
Wikipedia,  and  found  that  57%  of  the  sentences  could  be  interpreted  via  QP  semantics  (McFate  & 

Forbus,  2016b).  Qver  half  of  those  involved  lexical  units  (e.g.  "flow")  that  evoked  continuous  processes. 
This  provides  further  evidence  for  the  importance  of  qualitative  representations  as  a  constituent  of 
natural  language  semantics.  These  techniques  enabled  us  to  successfully  extract  knowledge  from 
chapter-length  texts  (McFate  &  Forbus,  2016c). 

Qualitative  Reasoning  and  Science  Tests 

Elementary  school  science  tests  provide  a  useful  way  to  measure  some  aspects  of  commonsense 
knowledge.  By  analyzing  a  corpus  of  exams,  we  found  that  29%  of  such  questions  involve  qualitative 
reasoning  (supporting  our  hypothesis  of  its  importance  in  commonsense).  In  this  subset,  13%  appeared 
to  be  handled  by  standard  qualitative  reasoning  techniques,  and  19%  require  several  kinds  of 
extensions,  such  as  learning  specific  patterns  (e.g.  food  webs,  life  cycles).  We  demonstrated  that  the 
standard  QR  questions  can  indeed  be  solved  by  off-the-shelf  techniques,  using  all  31  questions  of  that 
type  from  the  corpus  (Crouse  &  Forbus,  2016). 

Analogical  Word  Sense  Disambiguation 

We  showed  that  analogy  could  be  used  for  both  word  sense  disambiguation  and  parse  tree 
disambiguation.  To  record  a  decision  that  a  specific  occurrence  of  a  word  (e.g.  "star")  denoted  a  specific 
conceptual  sense  (e.g.  famous  human,  geometric  shape,  astronomical  body,  as  drawn  from  a  large  off- 
the-shelf  knowledge  base),  our  system  automatically  constructed  cases  during  natural  language 
understanding  that  combined  the  syntactic  and  semantic  information  it  generated  that  supported  that 
decision.  When  understanding  subsequent  texts,  those  cases  are  retrieved  and  used  to  provide 
evidence  for  interpreting  the  new  text.  Qur  method  gets  accuracies  in  the  same  range  as  state  of  the  art 
word  sense  disambiguation  algorithms,  but  requires  very  little  data.  During  this  project,  an  expanded 
version  of  the  conference  paper  on  analogical  word  sense  disambiguation  appeared  in  the  journal 
Advances  in  Cognitive  Systems  (Barbella  &  Forbus,  2013). 

Knowledge  Segmentation  for  Analogy-based  Learning 

Flow  should  a  system  that  learns  by  reading  organize  what  it  learns?  Qbvious  possibilities  are  to  store 
the  interpretation  of  each  sentence,  paragraph,  chapter,  or  even  larger  units  as  a  bundle  of  knowledge. 
While  those  surface-level  organizations  are  valuable  for  readers,  they  are  only  indirect  markers 
concerning  what  aspects  of  the  underlying  content  are  conceptually  connected.  A  topic  might  be 
introduced  over  multiple  sentences  within  a  paragraph,  with  the  rest  of  the  paragraph  being  about 
something  else,  for  example.  Given  our  hypothesis  that  analogy  provides  a  basis  for  human-like  robust 
intelligence,  it  makes  sense  to  think  about  how  to  organize  incoming  knowledge  in  terms  of  cases.  We 
call  this  the  knowledge  segmentation  problem  (Barbella  &  Forbus,  2015).  Qur  approach  is  to  pick  out  key 
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concepts  and/or  entities  from  the  understanding  of  the  text,  and  gather  up  into  a  case  the  facts  that 
mention  it  {fact-based  segmentation).  Such  cases  can  then  be  used  directly  for  analogical  reasoning 
(e.g.  answering  questions  about  how  two  things  are  similar  or  different)  and  for  analogical  learning  (e.g. 
building  up  a  general  model  of  a  concept  via  constructing  generalizations  based  on  multiple  examples  of 
it).  Our  fact-based  segmentation  algorithm  out-performed  sentence-based  algorithms  in  terms  of 
accuracy  on  a  compare  &  contrast  task  based  on  learned  knowledge.  Interestingly,  a  paragraph-based 
segmentation  algorithm  produced  results  that  were  worse,  but  not  statistically  significantly  so,  from 
fact-based  segmentation.  On  the  other  hand,  extra  facts  in  the  paragraph-based  algorithm  led  to  over 
twice  as  many  analogical  matching  errors  (Barbella  &  Forbus,  2016).  This  provides  evidence  in  favor  of 
fact-based  segmentation  for  organizing  learned  knowledge  for  subsequent  analogical  processing. 

Learning  Constructions  via  Analogy 

We  developed  an  analogical  model  for  learning  the  use  of  denominalized  verbs,  e.g.  "She  spooned  me 
some  sugar."  Instead  of  introducing  a  new  verb,  people  seem  to  recognize  the  intended  meaning  via  a 
double-object  construction,  which  they  associate  with  transfer  events.  Given  just  21  manually 
annotated  sentences,  a  Companion  was  able  to  learn  the  double-object  construction  and  then  use  it  to 
successfully  interpret  novel  verbs  that  people  are  able  to  interpret,  as  determined  by  a  psycholinguistic 
study  conducted  elsewhere  (McFate  2016a). 

Other  progress  on  analogy 

We  published  an  extended  paper  on  the  current  version  of  the  Structure-Mapping  Engine,  our  model  of 
analogical  mapping  and  inference,  along  with  the  source  code  and  a  corpus  of  over  5,800  comparisons 
to  encourage  research  in  this  area  (Forbus  et  al.  2016). 

In  collaboration  with  researchers  from  Google,  we  showed  that  analogical  matching  over  relational 
representations  produced  by  Stanford's  Core  NLP  parser,  combined  with  attributed  relational  graphs 
that  combine  structured  information  with  word2vec  statistics,  can  achieve  results  competitive  with  the 
state  of  the  art  in  the  Microsoft  Paraphrase  task  (Liang  et  al.  2016).  This  also  illustrates  that  our 
approach  does  not  depend  on  a  particular  ontology  or  knowledge  base,  but  is  general  across  relational 
representations. 

Part  of  our  long-range  goal,  into  which  this  project  feeds,  is  achieving  human-level  Al  via  constructing 
software  social  organisms.  We  expanded  our  formulation  by  identifying  substrate  capabilities  needed 
for  software  social  organisms,  based  on  research  in  other  areas  of  cognitive  science  (Forbus,  2016). 

Technology  Transfer 

Our  natural-language  system  has  been  used  by  three  other  projects  in  our  group: 

1.  ONR  Socio-Cognitive  Architectures  Program:  We  are  using  natural  language  and  sketching  to 
communicate  simple  tasks  to  Companions,  with  the  goal  of  being  able  to  instruct  them  to  perform  in 
a  variety  of  psychological  experiments,  entirely  through  these  natural  modalities.  We  are  also 
exploring  learning  social  reasoning,  theory  of  mind,  and  moral  reasoning  through  natural  language 
understanding  of  microstories:  Short  snippets  of  text  that  provide  pre-segmented  information  that 
might  be  combined  via  analogical  generalization  into  useful  knowledge. 

2.  AFOSR  Machine  Intelligence  Program:  We  developed  the  idea  of  type-level  qualitative 
representations  in  this  project,  which  we  are  applying  more  broadly  here.  That  project  is  also 
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exploring  the  use  of  natural  language  advice  (e.g.  "Adding  a  university  to  a  city  increases  its  science 
output.")  and  labeling  events  during  learning  by  demonstration  (e.g.  "That  was  bad."  when  the 
trainer's  attack  failed)  as  part  of  exploring  how  to  extend  Companions  to  learn  as  apprentices 
working  with  people  in  a  simulated  environment. 

3.  NSF  Spatial  Intelligence  and  Learning  Center:  We  are  exploring  how  to  do  interactive  multimodal 
knowledge  capture,  using  a  combination  of  simplified  English  and  sketches  created  with  CogSketch 
(the  open-domain  sketch  understanding  system  developed  in  SILC),  focusing  on  analogies  commonly 
used  in  middle-school  science  education. 

By  contrast,  our  goal  in  this  project  was  to  develop  a  scientific  understanding  of  how  to  learn  deep 
conceptual  knowledge  via  reading,  and  to  be  able  to  use  that  knowledge  to  reason  robustly  about  the 
everyday  world.  We  believe  that  analogical  learning  provides  a  more  human-like  (and  hence  more 
scalable  and  reliable)  means  of  learning  from  reading  than  alternatives,  such  as  extracting  ground  facts 
via  information  extraction. 

We  have  three  ongoing  industrial  transitions: 

1.  We  are  collaborating  with  IBM  T.J.  Watson  Research  Center,  on  refining  our  knowledge  base 
materials  into  an  open-license  resource  that  can  benefit  the  community  more  broadly,  and  building 
a  next-generation  reasoning  system  that  relies  entirely  on  open-license  software. 

2.  With  the  help  of  Forbus  and  one  of  his  students,  Google  built  their  own  version  of  SME  for  in-house 
experiments.  This  led  to  a  joint  publication  in  IJCAI  on  its  use  in  the  Microsoft  Paraphrase  task. 

3.  Forbus'  group  is  a  Founding  Academic  Partner  in  the  Platform  for  Situated  Intelligence  consortium 
being  created  by  Microsoft  Research.  This  consortium  is  providing  us  with  new  multimodal  dialogue 
capabilities  which  we  are  using  in  Companions,  as  well  as  an  opportunity  to  disseminate  our 
research  even  more  widely. 
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